Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 10127 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.5 MiB |
| Average record size in memory | 160.0 B |
Variable types
| Numeric | 14 |
|---|---|
| Categorical | 6 |
Customer_Age is highly correlated with Months_on_book | High correlation |
Months_on_book is highly correlated with Customer_Age | High correlation |
Credit_Limit is highly correlated with Avg_Open_To_Buy | High correlation |
Total_Revolving_Bal is highly correlated with Avg_Utilization_Ratio | High correlation |
Avg_Open_To_Buy is highly correlated with Credit_Limit and 1 other fields | High correlation |
Total_Trans_Amt is highly correlated with Total_Trans_Ct | High correlation |
Total_Trans_Ct is highly correlated with Total_Trans_Amt | High correlation |
Avg_Utilization_Ratio is highly correlated with Total_Revolving_Bal and 1 other fields | High correlation |
Customer_Age is highly correlated with Months_on_book | High correlation |
Months_on_book is highly correlated with Customer_Age | High correlation |
Credit_Limit is highly correlated with Avg_Open_To_Buy | High correlation |
Total_Revolving_Bal is highly correlated with Avg_Utilization_Ratio | High correlation |
Avg_Open_To_Buy is highly correlated with Credit_Limit and 1 other fields | High correlation |
Total_Trans_Amt is highly correlated with Total_Trans_Ct | High correlation |
Total_Trans_Ct is highly correlated with Total_Trans_Amt | High correlation |
Avg_Utilization_Ratio is highly correlated with Total_Revolving_Bal and 1 other fields | High correlation |
Customer_Age is highly correlated with Months_on_book | High correlation |
Months_on_book is highly correlated with Customer_Age | High correlation |
Credit_Limit is highly correlated with Avg_Open_To_Buy | High correlation |
Total_Revolving_Bal is highly correlated with Avg_Utilization_Ratio | High correlation |
Avg_Open_To_Buy is highly correlated with Credit_Limit and 1 other fields | High correlation |
Total_Trans_Amt is highly correlated with Total_Trans_Ct | High correlation |
Total_Trans_Ct is highly correlated with Total_Trans_Amt | High correlation |
Avg_Utilization_Ratio is highly correlated with Total_Revolving_Bal and 1 other fields | High correlation |
Sex is highly correlated with Income_Category | High correlation |
Income_Category is highly correlated with Sex | High correlation |
Customer_Age is highly correlated with Dependent_Count and 1 other fields | High correlation |
Sex is highly correlated with Income_Category and 2 other fields | High correlation |
Dependent_Count is highly correlated with Customer_Age | High correlation |
Income_Category is highly correlated with Sex | High correlation |
Card_Category is highly correlated with Credit_Limit and 1 other fields | High correlation |
Months_on_book is highly correlated with Customer_Age | High correlation |
Credit_Limit is highly correlated with Sex and 3 other fields | High correlation |
Total_Revolving_Bal is highly correlated with Avg_Utilization_Ratio and 1 other fields | High correlation |
Avg_Open_To_Buy is highly correlated with Sex and 3 other fields | High correlation |
Total_Amt_Chng_Q4_Q1 is highly correlated with Total_Ct_Chng_Q4_Q1 | High correlation |
Total_Trans_Amt is highly correlated with Total_Trans_Ct | High correlation |
Total_Trans_Ct is highly correlated with Total_Trans_Amt and 1 other fields | High correlation |
Total_Ct_Chng_Q4_Q1 is highly correlated with Total_Amt_Chng_Q4_Q1 | High correlation |
Avg_Utilization_Ratio is highly correlated with Credit_Limit and 2 other fields | High correlation |
Churn is highly correlated with Total_Revolving_Bal and 1 other fields | High correlation |
Dependent_Count has 904 (8.9%) zeros | Zeros |
Contacts_Count_12_mon has 399 (3.9%) zeros | Zeros |
Total_Revolving_Bal has 2470 (24.4%) zeros | Zeros |
Avg_Utilization_Ratio has 2470 (24.4%) zeros | Zeros |
Reproduction
| Analysis started | 2021-12-30 04:17:50.958378 |
|---|---|
| Analysis finished | 2021-12-30 04:20:30.871867 |
| Duration | 2 minutes and 39.91 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 45 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 46.3259603 |
| Minimum | 26 |
|---|---|
| Maximum | 73 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 79.2 KiB |
Quantile statistics
| Minimum | 26 |
|---|---|
| 5-th percentile | 33 |
| Q1 | 41 |
| median | 46 |
| Q3 | 52 |
| 95-th percentile | 60 |
| Maximum | 73 |
| Range | 47 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 8.016814033 |
|---|---|
| Coefficient of variation (CV) | 0.1730523011 |
| Kurtosis | -0.2886199153 |
| Mean | 46.3259603 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.03360501632 |
| Sum | 469143 |
| Variance | 64.26930723 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 44 | 500 | 4.9% |
| 49 | 495 | 4.9% |
| 46 | 490 | 4.8% |
| 45 | 486 | 4.8% |
| 47 | 479 | 4.7% |
| 43 | 473 | 4.7% |
| 48 | 472 | 4.7% |
| 50 | 452 | 4.5% |
| 42 | 426 | 4.2% |
| 51 | 398 | 3.9% |
| Other values (35) | 5456 |
| Value | Count | Frequency (%) |
| 26 | 78 | |
| 27 | 32 | 0.3% |
| 28 | 29 | 0.3% |
| 29 | 56 | 0.6% |
| 30 | 70 | 0.7% |
| 31 | 91 | |
| 32 | 106 | |
| 33 | 127 | |
| 34 | 146 | |
| 35 | 184 |
| Value | Count | Frequency (%) |
| 73 | 1 | < 0.1% |
| 70 | 1 | < 0.1% |
| 68 | 2 | < 0.1% |
| 67 | 4 | < 0.1% |
| 66 | 2 | < 0.1% |
| 65 | 101 | |
| 64 | 43 | |
| 63 | 65 | |
| 62 | 93 | |
| 61 | 93 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.2 KiB |
| F | |
|---|---|
| M |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | M |
|---|---|
| 2nd row | F |
| 3rd row | M |
| 4th row | F |
| 5th row | M |
Common Values
| Value | Count | Frequency (%) |
| F | 5358 | |
| M | 4769 |
Length
Pie chart
| Value | Count | Frequency (%) |
| f | 5358 | |
| m | 4769 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.346203219 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 904 |
| Zeros (%) | 8.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 79.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.298908349 |
|---|---|
| Coefficient of variation (CV) | 0.5536214162 |
| Kurtosis | -0.6830166531 |
| Mean | 2.346203219 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.02082553562 |
| Sum | 23760 |
| Variance | 1.687162899 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 2732 | |
| 2 | 2655 | |
| 1 | 1838 | |
| 4 | 1574 | |
| 0 | 904 | 8.9% |
| 5 | 424 | 4.2% |
| Value | Count | Frequency (%) |
| 0 | 904 | 8.9% |
| 1 | 1838 | |
| 2 | 2655 | |
| 3 | 2732 | |
| 4 | 1574 | |
| 5 | 424 | 4.2% |
| Value | Count | Frequency (%) |
| 5 | 424 | 4.2% |
| 4 | 1574 | |
| 3 | 2732 | |
| 2 | 2655 | |
| 1 | 1838 | |
| 0 | 904 | 8.9% |
Education_Level
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.2 KiB |
| Graduate | |
|---|---|
| High School | |
| Unknown | |
| Uneducated | |
| College | |
| Other values (2) |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 8.939271255 |
| Min length | 7 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | High School |
|---|---|
| 2nd row | Graduate |
| 3rd row | Graduate |
| 4th row | High School |
| 5th row | Uneducated |
Common Values
| Value | Count | Frequency (%) |
| Graduate | 3128 | |
| High School | 2013 | |
| Unknown | 1519 | |
| Uneducated | 1487 | |
| College | 1013 | 10.0% |
| Post-Graduate | 516 | 5.1% |
| Doctorate | 451 | 4.5% |
Length
Pie chart
| Value | Count | Frequency (%) |
| graduate | 3128 | |
| high | 2013 | |
| school | 2013 | |
| unknown | 1519 | |
| uneducated | 1487 | |
| college | 1013 | 8.3% |
| post-graduate | 516 | 4.3% |
| doctorate | 451 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Marital_Status
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.2 KiB |
| Married | |
|---|---|
| Single | |
| Unknown | |
| Divorced |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.684506764 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Married |
|---|---|
| 2nd row | Single |
| 3rd row | Married |
| 4th row | Unknown |
| 5th row | Married |
Common Values
| Value | Count | Frequency (%) |
| Married | 4687 | |
| Single | 3943 | |
| Unknown | 749 | 7.4% |
| Divorced | 748 | 7.4% |
Length
Pie chart
| Value | Count | Frequency (%) |
| married | 4687 | |
| single | 3943 | |
| unknown | 749 | 7.4% |
| divorced | 748 | 7.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.2 KiB |
| Less than $40K | |
|---|---|
| $40K - $60K | |
| $80K - $120K | |
| $60K - $80K | |
| Unknown |
Length
| Max length | 14 |
|---|---|
| Median length | 12 |
| Mean length | 11.4801027 |
| Min length | 7 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | $60K - $80K |
|---|---|
| 2nd row | Less than $40K |
| 3rd row | $80K - $120K |
| 4th row | Less than $40K |
| 5th row | $60K - $80K |
Common Values
| Value | Count | Frequency (%) |
| Less than $40K | 3561 | |
| $40K - $60K | 1790 | |
| $80K - $120K | 1535 | |
| $60K - $80K | 1402 | 13.8% |
| Unknown | 1112 | 11.0% |
| $120K + | 727 | 7.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 5454 | ||
| 40k | 5351 | |
| less | 3561 | |
| than | 3561 | |
| 60k | 3192 | |
| 80k | 2937 | |
| 120k | 2262 | |
| unknown | 1112 | 4.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.2 KiB |
| Blue | |
|---|---|
| Silver | 555 |
| Gold | 116 |
| Platinum | 20 |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.117507653 |
| Min length | 4 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Blue |
|---|---|
| 2nd row | Blue |
| 3rd row | Blue |
| 4th row | Blue |
| 5th row | Blue |
Common Values
| Value | Count | Frequency (%) |
| Blue | 9436 | |
| Silver | 555 | 5.5% |
| Gold | 116 | 1.1% |
| Platinum | 20 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| blue | 9436 | |
| silver | 555 | 5.5% |
| gold | 116 | 1.1% |
| platinum | 20 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Months_on_book
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 44 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 35.9284092 |
| Minimum | 13 |
|---|---|
| Maximum | 56 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 79.2 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 31 |
| median | 36 |
| Q3 | 40 |
| 95-th percentile | 50 |
| Maximum | 56 |
| Range | 43 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 7.986416331 |
|---|---|
| Coefficient of variation (CV) | 0.2222869453 |
| Kurtosis | 0.4001001202 |
| Mean | 35.9284092 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | -0.1065653599 |
| Sum | 363847 |
| Variance | 63.78284581 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 36 | 2463 | |
| 37 | 358 | 3.5% |
| 34 | 353 | 3.5% |
| 38 | 347 | 3.4% |
| 39 | 341 | 3.4% |
| 40 | 333 | 3.3% |
| 31 | 318 | 3.1% |
| 35 | 317 | 3.1% |
| 33 | 305 | 3.0% |
| 30 | 300 | 3.0% |
| Other values (34) | 4692 |
| Value | Count | Frequency (%) |
| 13 | 70 | |
| 14 | 16 | 0.2% |
| 15 | 34 | 0.3% |
| 16 | 29 | 0.3% |
| 17 | 39 | 0.4% |
| 18 | 58 | |
| 19 | 63 | |
| 20 | 74 | |
| 21 | 83 | |
| 22 | 105 |
| Value | Count | Frequency (%) |
| 56 | 103 | |
| 55 | 42 | 0.4% |
| 54 | 53 | 0.5% |
| 53 | 78 | |
| 52 | 62 | 0.6% |
| 51 | 80 | |
| 50 | 96 | |
| 49 | 141 | |
| 48 | 162 | |
| 47 | 171 |
Total_Relationship_Count
Real number (ℝ≥0)
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.812580231 |
| Minimum | 1 |
|---|---|
| Maximum | 6 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 79.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.554407865 |
|---|---|
| Coefficient of variation (CV) | 0.4077049586 |
| Kurtosis | -1.006130507 |
| Mean | 3.812580231 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.162452415 |
| Sum | 38610 |
| Variance | 2.416183812 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 2305 | |
| 4 | 1912 | |
| 5 | 1891 | |
| 6 | 1866 | |
| 2 | 1243 | |
| 1 | 910 | 9.0% |
| Value | Count | Frequency (%) |
| 1 | 910 | 9.0% |
| 2 | 1243 | |
| 3 | 2305 | |
| 4 | 1912 | |
| 5 | 1891 | |
| 6 | 1866 |
| Value | Count | Frequency (%) |
| 6 | 1866 | |
| 5 | 1891 | |
| 4 | 1912 | |
| 3 | 2305 | |
| 2 | 1243 | |
| 1 | 910 | 9.0% |
Months_Inactive_12_mon
Real number (ℝ≥0)
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.341167177 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 29 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 79.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.010622399 |
|---|---|
| Coefficient of variation (CV) | 0.4316745978 |
| Kurtosis | 1.098522614 |
| Mean | 2.341167177 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.633061129 |
| Sum | 23709 |
| Variance | 1.021357634 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 3846 | |
| 2 | 3282 | |
| 1 | 2233 | |
| 4 | 435 | 4.3% |
| 5 | 178 | 1.8% |
| 6 | 124 | 1.2% |
| 0 | 29 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 29 | 0.3% |
| 1 | 2233 | |
| 2 | 3282 | |
| 3 | 3846 | |
| 4 | 435 | 4.3% |
| 5 | 178 | 1.8% |
| 6 | 124 | 1.2% |
| Value | Count | Frequency (%) |
| 6 | 124 | 1.2% |
| 5 | 178 | 1.8% |
| 4 | 435 | 4.3% |
| 3 | 3846 | |
| 2 | 3282 | |
| 1 | 2233 | |
| 0 | 29 | 0.3% |
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.455317468 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 399 |
| Zeros (%) | 3.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 79.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 4 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.106225143 |
|---|---|
| Coefficient of variation (CV) | 0.4505426109 |
| Kurtosis | 0.0008626566254 |
| Mean | 2.455317468 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.01100562622 |
| Sum | 24865 |
| Variance | 1.223734066 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 3380 | |
| 2 | 3227 | |
| 1 | 1499 | |
| 4 | 1392 | |
| 0 | 399 | 3.9% |
| 5 | 176 | 1.7% |
| 6 | 54 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 399 | 3.9% |
| 1 | 1499 | |
| 2 | 3227 | |
| 3 | 3380 | |
| 4 | 1392 | |
| 5 | 176 | 1.7% |
| 6 | 54 | 0.5% |
| Value | Count | Frequency (%) |
| 6 | 54 | 0.5% |
| 5 | 176 | 1.7% |
| 4 | 1392 | |
| 3 | 3380 | |
| 2 | 3227 | |
| 1 | 1499 | |
| 0 | 399 | 3.9% |
| Distinct | 6205 |
|---|---|
| Distinct (%) | 61.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8631.953698 |
| Minimum | 1438.3 |
|---|---|
| Maximum | 34516 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 79.2 KiB |
Quantile statistics
| Minimum | 1438.3 |
|---|---|
| 5-th percentile | 1438.51 |
| Q1 | 2555 |
| median | 4549 |
| Q3 | 11067.5 |
| 95-th percentile | 34516 |
| Maximum | 34516 |
| Range | 33077.7 |
| Interquartile range (IQR) | 8512.5 |
Descriptive statistics
| Standard deviation | 9088.77665 |
|---|---|
| Coefficient of variation (CV) | 1.052922313 |
| Kurtosis | 1.808989336 |
| Mean | 8631.953698 |
| Median Absolute Deviation (MAD) | 2593 |
| Skewness | 1.666725808 |
| Sum | 87415795.1 |
| Variance | 82605861 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 34516 | 508 | 5.0% |
| 1438.3 | 507 | 5.0% |
| 9959 | 18 | 0.2% |
| 15987 | 18 | 0.2% |
| 23981 | 12 | 0.1% |
| 2490 | 11 | 0.1% |
| 6224 | 11 | 0.1% |
| 3735 | 11 | 0.1% |
| 7469 | 10 | 0.1% |
| 2069 | 8 | 0.1% |
| Other values (6195) | 9013 |
| Value | Count | Frequency (%) |
| 1438.3 | 507 | |
| 1439 | 2 | < 0.1% |
| 1440 | 1 | < 0.1% |
| 1441 | 2 | < 0.1% |
| 1442 | 1 | < 0.1% |
| 1443 | 3 | < 0.1% |
| 1446 | 1 | < 0.1% |
| 1449 | 2 | < 0.1% |
| 1451 | 2 | < 0.1% |
| 1452 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 34516 | 508 | |
| 34496 | 1 | < 0.1% |
| 34458 | 1 | < 0.1% |
| 34427 | 1 | < 0.1% |
| 34198 | 1 | < 0.1% |
| 34173 | 1 | < 0.1% |
| 34162 | 1 | < 0.1% |
| 34140 | 1 | < 0.1% |
| 34058 | 1 | < 0.1% |
| 34010 | 1 | < 0.1% |
Total_Revolving_Bal
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 1974 |
|---|---|
| Distinct (%) | 19.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1162.814061 |
| Minimum | 0 |
|---|---|
| Maximum | 2517 |
| Zeros | 2470 |
| Zeros (%) | 24.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 79.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 359 |
| median | 1276 |
| Q3 | 1784 |
| 95-th percentile | 2517 |
| Maximum | 2517 |
| Range | 2517 |
| Interquartile range (IQR) | 1425 |
Descriptive statistics
| Standard deviation | 814.9873352 |
|---|---|
| Coefficient of variation (CV) | 0.7008750257 |
| Kurtosis | -1.145991782 |
| Mean | 1162.814061 |
| Median Absolute Deviation (MAD) | 591 |
| Skewness | -0.1488372503 |
| Sum | 11775818 |
| Variance | 664204.3566 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2470 | 24.4% |
| 2517 | 508 | 5.0% |
| 1965 | 12 | 0.1% |
| 1480 | 12 | 0.1% |
| 1434 | 11 | 0.1% |
| 1664 | 11 | 0.1% |
| 1720 | 11 | 0.1% |
| 1590 | 10 | 0.1% |
| 1542 | 10 | 0.1% |
| 1528 | 10 | 0.1% |
| Other values (1964) | 7062 |
| Value | Count | Frequency (%) |
| 0 | 2470 | |
| 132 | 1 | < 0.1% |
| 134 | 1 | < 0.1% |
| 145 | 1 | < 0.1% |
| 154 | 1 | < 0.1% |
| 157 | 1 | < 0.1% |
| 159 | 2 | < 0.1% |
| 168 | 2 | < 0.1% |
| 170 | 1 | < 0.1% |
| 186 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 2517 | 508 | |
| 2514 | 3 | < 0.1% |
| 2513 | 1 | < 0.1% |
| 2512 | 2 | < 0.1% |
| 2511 | 1 | < 0.1% |
| 2509 | 2 | < 0.1% |
| 2508 | 2 | < 0.1% |
| 2507 | 4 | < 0.1% |
| 2506 | 1 | < 0.1% |
| 2505 | 3 | < 0.1% |
Avg_Open_To_Buy
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 6813 |
|---|---|
| Distinct (%) | 67.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7469.139637 |
| Minimum | 3 |
|---|---|
| Maximum | 34516 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 79.2 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 480.3 |
| Q1 | 1324.5 |
| median | 3474 |
| Q3 | 9859 |
| 95-th percentile | 32183.4 |
| Maximum | 34516 |
| Range | 34513 |
| Interquartile range (IQR) | 8534.5 |
Descriptive statistics
| Standard deviation | 9090.685324 |
|---|---|
| Coefficient of variation (CV) | 1.217099394 |
| Kurtosis | 1.798617296 |
| Mean | 7469.139637 |
| Median Absolute Deviation (MAD) | 2665 |
| Skewness | 1.661696546 |
| Sum | 75639977.1 |
| Variance | 82640559.65 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1438.3 | 324 | 3.2% |
| 34516 | 98 | 1.0% |
| 31999 | 26 | 0.3% |
| 787 | 8 | 0.1% |
| 701 | 7 | 0.1% |
| 713 | 7 | 0.1% |
| 953 | 7 | 0.1% |
| 463 | 7 | 0.1% |
| 990 | 6 | 0.1% |
| 788 | 6 | 0.1% |
| Other values (6803) | 9631 |
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 10 | 1 | |
| 14 | 2 | |
| 15 | 1 | |
| 24 | 1 | |
| 28 | 1 | |
| 29 | 1 | |
| 36 | 1 | |
| 39 | 2 | |
| 41 | 2 |
| Value | Count | Frequency (%) |
| 34516 | 98 | |
| 34362 | 1 | < 0.1% |
| 34302 | 1 | < 0.1% |
| 34300 | 1 | < 0.1% |
| 34297 | 1 | < 0.1% |
| 34286 | 1 | < 0.1% |
| 34238 | 1 | < 0.1% |
| 34227 | 1 | < 0.1% |
| 34140 | 1 | < 0.1% |
| 34119 | 1 | < 0.1% |
| Distinct | 1158 |
|---|---|
| Distinct (%) | 11.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.7599406537 |
| Minimum | 0 |
|---|---|
| Maximum | 3.397 |
| Zeros | 5 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 79.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.463 |
| Q1 | 0.631 |
| median | 0.736 |
| Q3 | 0.859 |
| 95-th percentile | 1.103 |
| Maximum | 3.397 |
| Range | 3.397 |
| Interquartile range (IQR) | 0.228 |
Descriptive statistics
| Standard deviation | 0.2192067692 |
|---|---|
| Coefficient of variation (CV) | 0.288452484 |
| Kurtosis | 9.993501179 |
| Mean | 0.7599406537 |
| Median Absolute Deviation (MAD) | 0.114 |
| Skewness | 1.732063411 |
| Sum | 7695.919 |
| Variance | 0.04805160768 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.791 | 36 | 0.4% |
| 0.712 | 34 | 0.3% |
| 0.743 | 34 | 0.3% |
| 0.718 | 33 | 0.3% |
| 0.735 | 33 | 0.3% |
| 0.744 | 32 | 0.3% |
| 0.699 | 32 | 0.3% |
| 0.722 | 32 | 0.3% |
| 0.731 | 31 | 0.3% |
| 0.631 | 31 | 0.3% |
| Other values (1148) | 9799 |
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 0.01 | 1 | < 0.1% |
| 0.018 | 1 | < 0.1% |
| 0.046 | 1 | < 0.1% |
| 0.061 | 2 | < 0.1% |
| 0.072 | 1 | < 0.1% |
| 0.101 | 1 | < 0.1% |
| 0.12 | 1 | < 0.1% |
| 0.153 | 1 | < 0.1% |
| 0.163 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3.397 | 1 | |
| 3.355 | 1 | |
| 2.675 | 1 | |
| 2.594 | 1 | |
| 2.368 | 1 | |
| 2.357 | 1 | |
| 2.316 | 1 | |
| 2.282 | 1 | |
| 2.275 | 1 | |
| 2.271 | 1 |
Total_Trans_Amt
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 5033 |
|---|---|
| Distinct (%) | 49.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4404.086304 |
| Minimum | 510 |
|---|---|
| Maximum | 18484 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 79.2 KiB |
Quantile statistics
| Minimum | 510 |
|---|---|
| 5-th percentile | 1283.3 |
| Q1 | 2155.5 |
| median | 3899 |
| Q3 | 4741 |
| 95-th percentile | 14212 |
| Maximum | 18484 |
| Range | 17974 |
| Interquartile range (IQR) | 2585.5 |
Descriptive statistics
| Standard deviation | 3397.129254 |
|---|---|
| Coefficient of variation (CV) | 0.7713584656 |
| Kurtosis | 3.894023406 |
| Mean | 4404.086304 |
| Median Absolute Deviation (MAD) | 1308 |
| Skewness | 2.041003403 |
| Sum | 44600182 |
| Variance | 11540487.17 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4253 | 11 | 0.1% |
| 4509 | 11 | 0.1% |
| 4518 | 10 | 0.1% |
| 2229 | 10 | 0.1% |
| 4220 | 9 | 0.1% |
| 4869 | 9 | 0.1% |
| 4037 | 9 | 0.1% |
| 4313 | 9 | 0.1% |
| 4498 | 9 | 0.1% |
| 4042 | 9 | 0.1% |
| Other values (5023) | 10031 |
| Value | Count | Frequency (%) |
| 510 | 1 | |
| 530 | 1 | |
| 563 | 1 | |
| 569 | 1 | |
| 594 | 1 | |
| 596 | 1 | |
| 597 | 1 | |
| 602 | 1 | |
| 615 | 1 | |
| 643 | 1 |
| Value | Count | Frequency (%) |
| 18484 | 1 | |
| 17995 | 1 | |
| 17744 | 1 | |
| 17634 | 1 | |
| 17628 | 1 | |
| 17498 | 1 | |
| 17437 | 1 | |
| 17390 | 1 | |
| 17350 | 1 | |
| 17258 | 1 |
Total_Trans_Ct
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 126 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 64.85869458 |
| Minimum | 10 |
|---|---|
| Maximum | 139 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 79.2 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 28 |
| Q1 | 45 |
| median | 67 |
| Q3 | 81 |
| 95-th percentile | 105 |
| Maximum | 139 |
| Range | 129 |
| Interquartile range (IQR) | 36 |
Descriptive statistics
| Standard deviation | 23.47257045 |
|---|---|
| Coefficient of variation (CV) | 0.3619032206 |
| Kurtosis | -0.3671632411 |
| Mean | 64.85869458 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | 0.1536730685 |
| Sum | 656824 |
| Variance | 550.9615635 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 81 | 208 | 2.1% |
| 71 | 203 | 2.0% |
| 75 | 203 | 2.0% |
| 69 | 202 | 2.0% |
| 82 | 202 | 2.0% |
| 76 | 198 | 2.0% |
| 77 | 197 | 1.9% |
| 70 | 193 | 1.9% |
| 74 | 190 | 1.9% |
| 78 | 190 | 1.9% |
| Other values (116) | 8141 |
| Value | Count | Frequency (%) |
| 10 | 4 | < 0.1% |
| 11 | 2 | < 0.1% |
| 12 | 4 | < 0.1% |
| 13 | 5 | < 0.1% |
| 14 | 9 | 0.1% |
| 15 | 16 | |
| 16 | 13 | |
| 17 | 13 | |
| 18 | 23 | |
| 19 | 11 |
| Value | Count | Frequency (%) |
| 139 | 1 | < 0.1% |
| 138 | 1 | < 0.1% |
| 134 | 1 | < 0.1% |
| 132 | 1 | < 0.1% |
| 131 | 6 | |
| 130 | 5 | |
| 129 | 6 | |
| 128 | 10 | |
| 127 | 12 | |
| 126 | 10 |
| Distinct | 830 |
|---|---|
| Distinct (%) | 8.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.7122223758 |
| Minimum | 0 |
|---|---|
| Maximum | 3.714 |
| Zeros | 7 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 79.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.368 |
| Q1 | 0.582 |
| median | 0.702 |
| Q3 | 0.818 |
| 95-th percentile | 1.069 |
| Maximum | 3.714 |
| Range | 3.714 |
| Interquartile range (IQR) | 0.236 |
Descriptive statistics
| Standard deviation | 0.2380860913 |
|---|---|
| Coefficient of variation (CV) | 0.3342861716 |
| Kurtosis | 15.6892929 |
| Mean | 0.7122223758 |
| Median Absolute Deviation (MAD) | 0.119 |
| Skewness | 2.064030568 |
| Sum | 7212.676 |
| Variance | 0.05668498689 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.667 | 171 | 1.7% |
| 1 | 166 | 1.6% |
| 0.5 | 161 | 1.6% |
| 0.75 | 156 | 1.5% |
| 0.6 | 113 | 1.1% |
| 0.8 | 101 | 1.0% |
| 0.714 | 92 | 0.9% |
| 0.833 | 85 | 0.8% |
| 0.778 | 69 | 0.7% |
| 0.625 | 63 | 0.6% |
| Other values (820) | 8950 |
| Value | Count | Frequency (%) |
| 0 | 7 | |
| 0.028 | 1 | < 0.1% |
| 0.029 | 1 | < 0.1% |
| 0.038 | 1 | < 0.1% |
| 0.053 | 1 | < 0.1% |
| 0.059 | 2 | < 0.1% |
| 0.062 | 1 | < 0.1% |
| 0.074 | 1 | < 0.1% |
| 0.077 | 3 | |
| 0.091 | 3 |
| Value | Count | Frequency (%) |
| 3.714 | 1 | < 0.1% |
| 3.571 | 1 | < 0.1% |
| 3.5 | 1 | < 0.1% |
| 3.25 | 1 | < 0.1% |
| 3 | 2 | |
| 2.875 | 1 | < 0.1% |
| 2.75 | 1 | < 0.1% |
| 2.571 | 1 | < 0.1% |
| 2.5 | 3 | |
| 2.429 | 1 | < 0.1% |
Avg_Utilization_Ratio
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 964 |
|---|---|
| Distinct (%) | 9.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2748935519 |
| Minimum | 0 |
|---|---|
| Maximum | 0.999 |
| Zeros | 2470 |
| Zeros (%) | 24.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 79.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.023 |
| median | 0.176 |
| Q3 | 0.503 |
| 95-th percentile | 0.793 |
| Maximum | 0.999 |
| Range | 0.999 |
| Interquartile range (IQR) | 0.48 |
Descriptive statistics
| Standard deviation | 0.2756914693 |
|---|---|
| Coefficient of variation (CV) | 1.002902641 |
| Kurtosis | -0.7949719515 |
| Mean | 0.2748935519 |
| Median Absolute Deviation (MAD) | 0.176 |
| Skewness | 0.7180079968 |
| Sum | 2783.847 |
| Variance | 0.07600578622 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2470 | 24.4% |
| 0.073 | 44 | 0.4% |
| 0.057 | 33 | 0.3% |
| 0.048 | 32 | 0.3% |
| 0.06 | 30 | 0.3% |
| 0.061 | 29 | 0.3% |
| 0.045 | 29 | 0.3% |
| 0.059 | 28 | 0.3% |
| 0.069 | 28 | 0.3% |
| 0.053 | 27 | 0.3% |
| Other values (954) | 7377 |
| Value | Count | Frequency (%) |
| 0 | 2470 | |
| 0.004 | 1 | < 0.1% |
| 0.005 | 1 | < 0.1% |
| 0.006 | 3 | < 0.1% |
| 0.007 | 1 | < 0.1% |
| 0.008 | 2 | < 0.1% |
| 0.009 | 1 | < 0.1% |
| 0.01 | 1 | < 0.1% |
| 0.011 | 1 | < 0.1% |
| 0.012 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0.999 | 1 | < 0.1% |
| 0.995 | 1 | < 0.1% |
| 0.994 | 1 | < 0.1% |
| 0.992 | 1 | < 0.1% |
| 0.99 | 1 | < 0.1% |
| 0.988 | 1 | < 0.1% |
| 0.987 | 1 | < 0.1% |
| 0.985 | 1 | < 0.1% |
| 0.984 | 1 | < 0.1% |
| 0.983 | 4 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 79.2 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 8500 | |
| 1 | 1627 | 16.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 8500 | |
| 1 | 1627 | 16.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Customer_Age | Sex | Dependent_Count | Education_Level | Marital_Status | Income_Category | Card_Category | Months_on_book | Total_Relationship_Count | Months_Inactive_12_mon | Contacts_Count_12_mon | Credit_Limit | Total_Revolving_Bal | Avg_Open_To_Buy | Total_Amt_Chng_Q4_Q1 | Total_Trans_Amt | Total_Trans_Ct | Total_Ct_Chng_Q4_Q1 | Avg_Utilization_Ratio | Churn | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 45 | M | 3 | High School | Married | $60K - $80K | Blue | 39 | 5 | 1 | 3 | 12691.0 | 777 | 11914.0 | 1.335 | 1144 | 42 | 1.625 | 0.061 | 0 |
| 1 | 49 | F | 5 | Graduate | Single | Less than $40K | Blue | 44 | 6 | 1 | 2 | 8256.0 | 864 | 7392.0 | 1.541 | 1291 | 33 | 3.714 | 0.105 | 0 |
| 2 | 51 | M | 3 | Graduate | Married | $80K - $120K | Blue | 36 | 4 | 1 | 0 | 3418.0 | 0 | 3418.0 | 2.594 | 1887 | 20 | 2.333 | 0.000 | 0 |
| 3 | 40 | F | 4 | High School | Unknown | Less than $40K | Blue | 34 | 3 | 4 | 1 | 3313.0 | 2517 | 796.0 | 1.405 | 1171 | 20 | 2.333 | 0.760 | 0 |
| 4 | 40 | M | 3 | Uneducated | Married | $60K - $80K | Blue | 21 | 5 | 1 | 0 | 4716.0 | 0 | 4716.0 | 2.175 | 816 | 28 | 2.500 | 0.000 | 0 |
| 5 | 44 | M | 2 | Graduate | Married | $40K - $60K | Blue | 36 | 3 | 1 | 2 | 4010.0 | 1247 | 2763.0 | 1.376 | 1088 | 24 | 0.846 | 0.311 | 0 |
| 6 | 51 | M | 4 | Unknown | Married | $120K + | Gold | 46 | 6 | 1 | 3 | 34516.0 | 2264 | 32252.0 | 1.975 | 1330 | 31 | 0.722 | 0.066 | 0 |
| 7 | 32 | M | 0 | High School | Unknown | $60K - $80K | Silver | 27 | 2 | 2 | 2 | 29081.0 | 1396 | 27685.0 | 2.204 | 1538 | 36 | 0.714 | 0.048 | 0 |
| 8 | 37 | M | 3 | Uneducated | Single | $60K - $80K | Blue | 36 | 5 | 2 | 0 | 22352.0 | 2517 | 19835.0 | 3.355 | 1350 | 24 | 1.182 | 0.113 | 0 |
| 9 | 48 | M | 2 | Graduate | Single | $80K - $120K | Blue | 36 | 6 | 3 | 3 | 11656.0 | 1677 | 9979.0 | 1.524 | 1441 | 32 | 0.882 | 0.144 | 0 |
Last rows
| Customer_Age | Sex | Dependent_Count | Education_Level | Marital_Status | Income_Category | Card_Category | Months_on_book | Total_Relationship_Count | Months_Inactive_12_mon | Contacts_Count_12_mon | Credit_Limit | Total_Revolving_Bal | Avg_Open_To_Buy | Total_Amt_Chng_Q4_Q1 | Total_Trans_Amt | Total_Trans_Ct | Total_Ct_Chng_Q4_Q1 | Avg_Utilization_Ratio | Churn | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 10117 | 57 | M | 2 | Graduate | Married | $80K - $120K | Blue | 40 | 6 | 3 | 4 | 17925.0 | 1909 | 16016.0 | 0.712 | 17498 | 111 | 0.820 | 0.106 | 0 |
| 10118 | 50 | M | 1 | Unknown | Unknown | $80K - $120K | Blue | 36 | 6 | 3 | 4 | 9959.0 | 952 | 9007.0 | 0.825 | 10310 | 63 | 1.100 | 0.096 | 1 |
| 10119 | 55 | F | 3 | Uneducated | Single | Unknown | Blue | 47 | 4 | 3 | 3 | 14657.0 | 2517 | 12140.0 | 0.166 | 6009 | 53 | 0.514 | 0.172 | 1 |
| 10120 | 54 | M | 1 | High School | Single | $60K - $80K | Blue | 34 | 5 | 2 | 0 | 13940.0 | 2109 | 11831.0 | 0.660 | 15577 | 114 | 0.754 | 0.151 | 0 |
| 10121 | 56 | F | 1 | Graduate | Single | Less than $40K | Blue | 50 | 4 | 1 | 4 | 3688.0 | 606 | 3082.0 | 0.570 | 14596 | 120 | 0.791 | 0.164 | 0 |
| 10122 | 50 | M | 2 | Graduate | Single | $40K - $60K | Blue | 40 | 3 | 2 | 3 | 4003.0 | 1851 | 2152.0 | 0.703 | 15476 | 117 | 0.857 | 0.462 | 0 |
| 10123 | 41 | M | 2 | Unknown | Divorced | $40K - $60K | Blue | 25 | 4 | 2 | 3 | 4277.0 | 2186 | 2091.0 | 0.804 | 8764 | 69 | 0.683 | 0.511 | 1 |
| 10124 | 44 | F | 1 | High School | Married | Less than $40K | Blue | 36 | 5 | 3 | 4 | 5409.0 | 0 | 5409.0 | 0.819 | 10291 | 60 | 0.818 | 0.000 | 1 |
| 10125 | 30 | M | 2 | Graduate | Unknown | $40K - $60K | Blue | 36 | 4 | 3 | 3 | 5281.0 | 0 | 5281.0 | 0.535 | 8395 | 62 | 0.722 | 0.000 | 1 |
| 10126 | 43 | F | 2 | Graduate | Married | Less than $40K | Silver | 25 | 6 | 2 | 4 | 10388.0 | 1961 | 8427.0 | 0.703 | 10294 | 61 | 0.649 | 0.189 | 1 |